CDS

Accession Number TCMCG075C14706
gbkey CDS
Protein Id XP_007035526.2
Location join(30424433..30424540,30425213..30425311,30425534..30425596,30425763..30425840,30426268..30426457,30426682..30426941,30427024..30427402,30427864..30427941,30428928..30429148,30429288..30429350,30430119..30430442,30430529..30430707,30430785..30431067,30431339..30431575,30431769..30431930,30432336..30432488,30432583..30432801,30433159..30433458,30433575..30433739)
Gene LOC18603464
GeneID 18603464
Organism Theobroma cacao

Protein

Length 1186aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007035464.2
Definition PREDICTED: protein ALWAYS EARLY 3 isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category K
Description Protein ALWAYS EARLY
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCGCCATCTAGAAAATCTAAAAGTGTAAATAAGAAGTTTTCTTATGTTAATGAGGTTGCTTCTAGTAAAGATGGAGATAGTAGTGCTAAGAGAAGCGGGCAACGGAAAAGGAAGTTGTCTGACATGTTAGGGCCTCAATGGACTAAGGAAGAGCTTGAGCGTTTCTATGAAGCGTATCGCAAGTATGGGAAAGATTGGAAGAAGGTTGCTACTGTGGTACGAAATCGATCTGTGGAAATGGTAGAAGCTCTGTACACTATGAATAGGGCCTACTTATCTCTCCCGGAAGGCACTGCTTCTGTGGTTGGACTCATAGCGATGATGACTGATCACTATTGTGTTATGGGAGGAAGTGATAGTGAACAAGAAAGCAATGAGGGCGTGGGAGCTTCTCGGAAACCTCAGAAGCGTAGTAGGGGAAAACTTCGAGATCAACCCTCTAAAAGTTTAGATAAGTCATTTCCTGATCTTTTGCAATTTCATTCAGCTGCATCAAGTTATGGTTGCTTGTCATTGTTGAAGAGGAGACGCTCTGAAAGTAGGCCCCGTGCTGTTGGAAAAAGGACTCCTCGTGTTCCTATTTCTTTTTCTCATGACAAAAACAAAGGAGAAAGGTACTTTTCACCTATTAGGCAGGGCATGAAACTAAAGGTGGATACCGTTGATGATGATGTTGCTCATGAGATAGCATTAGTTTTGACGGAGGCATCACAAAGAGGTGGATCTCCTCAAGTTTCTCGAACACCAAACAGAAAAGCAGAGGCATCTTCACCTATTCTCAACAGTGAAAGGATGAATGCTGAGTCAGAAACTACTAGTGCCAAGATTCATGGTAGTGAAATGGATGAGGATGCTTGTGAATTGAGCTTAGGAAGCACTGAAGCTGATAATGCTGATTATGCTAGAGGTAAAAATTATTCAATGAATATAGAAGGGACTGGTACCATTGAAGTTCAACAGAAGGGAAAAAGATACTACAGAAGGAAGCCAGGGGTTGAGGAAAGTGTAAACAATCATCTGGAAGACACAAAAGAAGCCTGTAGTGGGACGGAAGAAGATCAAAAGTTATGTGATTTCAAGGGAAAGTTTGAAGCAGAGGTTGCAGATACCAAACCTTCTAGAGGCTCCATCAAGGGTCTAAGGAAAAGAAGTAAAAAAGTGTTGTTTGGGAGAGTTGAAGACACTTCCTTTGATGCCCTGCAAACTCTAGCAGATCTGTCCTTGATGATGCCAGAAACTGCTGCTGATACTGAGTCATCTGTGCAGTTCAAGGAAGAGAAAAATGAAGTTGTTGAGAAGACTAAACTGAAAGGAAACCATCCTGTTTCTGGAGCTAAAGGCACTGCCCCCAAAACATGTAAACAGGGAAAAGTTTTTGGTCATGATGTTCGTGCTATTCCCGAGGCAAAGGAGGAAACACACCCAGGTAATGTTGGAATGCGGAAAAGGAGACAGAAGTCCTCACCATATAAATTGCAGATTCCAAAAGATGAAACTGATGCTGATTCTCATTTGGGTGAATCTCGAAACATTGAGGCTTTAGATGAGGTAAAGAATTTTCCAAGCAAAGGTAAACGCTCTAATAATGTTGCACATTCAAAGCAAGGGAAATCAGTGAGACCTCCAGAGCATCGTTCCTCAAGTACTGATCATGGAAGGGACTTGAACAATTCAGCTCCATCTACCATACAGGTTTCACCTGTTAACCAGGTCAACCTACCCACAAAAGTCAGGAGTAAGAGAAAGATAGATGCACAGAAACAAGTGATTGGGAAGGATATAAAGTCCTCTGATGGTATTGTGAAGGGAAAATTTAGTGTTCCAGTTAGTTTATTCCATGACAGAGCACTCAATCTGAAGGAAAAGCTTTGTAACTTCCTATGTCCATATCAAGCACGGAGATGGTGTACCTTTGAGTGGTTCTGTAGTACAATTGATTATCCATGGTTTGCTAAAAGGGAGTTTGTGGAGTATTTGGATCATGTAGGATTGGGTCATGTTCCAAGATTAACTCGTGTTGAATGGGGTGTCATAAGGAGTTCCCTTGGCAAGCCACGAAGGTTTTCTGAGCAATTTTTGAAGGAAGAAAGAGAGAAGCTTTATCAATATCGGGAATCTGTTAGAACGCATTATGCTGAACTCCGTGCTGGTATTGGTGAAGGACTTCCAACTGATTTAGCTCGACCTCTATCAGTTGGACAGCGTGTTATTGCTATTCATCCAAAAACTAGAGAGATTCATGATGGAAATGTGTTAATTGTTGACCATAGTAGGTACCGGATTCAATTTGACAGCACTGAGCTAGGAGTGGAATCTGTCATGGATATTGATTGTATGGCTTTAAATCCATTGGAAAATTTGCCTGCTTCCCTTGTGAGACAAAATGCTGCTGTCAGGAAATTTTTTGAGAACTACAATGAGCTCAAAATGAACGGGCAGCCAAAAGAAAGCAAGATGGAAGAGAACATCAAATTTGCTTCGTGTGAGGAGAATGCCAATAGTCCCTCTCGAACTTCCCCATCAACTTTCAGTGTTGGCAATTTATCACAACTTGTTAAGGTTGATCCATCAAGTCCTAATTTACAACTTAAAGTTGGGCCTATGGAAACTGTTTATACTCAGCAGGCAGTAAATTCCCAGCCTTCTGCTCTGGCGCTGATACAGGCGAGGGAAGCTGATGTTGAAGCTCTTTCTCAGTTGACTCGTGCTCTTGACAAAAAGCATTTGCAGGAGGCTGTGGTCTCTGAACTACGGCGTATGAATGATGAGGTGTTGGAAAACCAGAAAGGTGGGGACAACTCTATAAAGGATTCAGATTCTTTCAAGAAGCAATATGCTGCTGTTCTTTTACAGTTAAATGAAGTCAATGAGCAGGTTTCTTCTGCTCTCTTTTCCTTGAGGCAACGCAATACATATCAAGGGACCTCCTCAGTTAGATTGCTGAAGCCCTTGGCTAAAATTGGTGAGCATGGTTGTCAGTTGAGCTCTTTTGATCATTCTATGCATCATGCCCAAGAATCTGTATCCCATGTGGCTGAAATTGTTGAAAGTTCAAGAACGAAAGCTCGGTCAATGGTGGATGCAGCTATGCAGGCTATGTCATCCTTGAGAAAAGGGGGGAAAAGCATCGAGAGGATTGAGGACGCAATAGATTTTGTAAATAACCAGCTTTCGGTGGATGATCTTAGTGTGCCTGCTCCGCGGTCTTCTATCCCAATAGACTCAGCCCACAGTACGGTAACTTTTCACGATCATCTCACTGCCTTTGTGTCAAATCCACTGGCAACTGGTCATGCACCTGATACAAAGTTGCAAAATTCGTCTGACCAAGACGATCTTAGAATCCCTTCAGACCTTATCGTGCATTGTGTAGCCACCTTGCTCATGATTCAGAAGTGTACAGAAAGGCAGTTTCCACCTGGAGATGTTGCCCAGGTACTAGATTCTGCTGTTACTAGTTTGAAGCCGTGTTGTTCACAAAATCTCTCAATTTATGCAGAGATACAGAAATGTATGGGAATTATTAGGAACCAGATATTGGCGCTGGTACCTACATAG
Protein:  
MAPSRKSKSVNKKFSYVNEVASSKDGDSSAKRSGQRKRKLSDMLGPQWTKEELERFYEAYRKYGKDWKKVATVVRNRSVEMVEALYTMNRAYLSLPEGTASVVGLIAMMTDHYCVMGGSDSEQESNEGVGASRKPQKRSRGKLRDQPSKSLDKSFPDLLQFHSAASSYGCLSLLKRRRSESRPRAVGKRTPRVPISFSHDKNKGERYFSPIRQGMKLKVDTVDDDVAHEIALVLTEASQRGGSPQVSRTPNRKAEASSPILNSERMNAESETTSAKIHGSEMDEDACELSLGSTEADNADYARGKNYSMNIEGTGTIEVQQKGKRYYRRKPGVEESVNNHLEDTKEACSGTEEDQKLCDFKGKFEAEVADTKPSRGSIKGLRKRSKKVLFGRVEDTSFDALQTLADLSLMMPETAADTESSVQFKEEKNEVVEKTKLKGNHPVSGAKGTAPKTCKQGKVFGHDVRAIPEAKEETHPGNVGMRKRRQKSSPYKLQIPKDETDADSHLGESRNIEALDEVKNFPSKGKRSNNVAHSKQGKSVRPPEHRSSSTDHGRDLNNSAPSTIQVSPVNQVNLPTKVRSKRKIDAQKQVIGKDIKSSDGIVKGKFSVPVSLFHDRALNLKEKLCNFLCPYQARRWCTFEWFCSTIDYPWFAKREFVEYLDHVGLGHVPRLTRVEWGVIRSSLGKPRRFSEQFLKEEREKLYQYRESVRTHYAELRAGIGEGLPTDLARPLSVGQRVIAIHPKTREIHDGNVLIVDHSRYRIQFDSTELGVESVMDIDCMALNPLENLPASLVRQNAAVRKFFENYNELKMNGQPKESKMEENIKFASCEENANSPSRTSPSTFSVGNLSQLVKVDPSSPNLQLKVGPMETVYTQQAVNSQPSALALIQAREADVEALSQLTRALDKKHLQEAVVSELRRMNDEVLENQKGGDNSIKDSDSFKKQYAAVLLQLNEVNEQVSSALFSLRQRNTYQGTSSVRLLKPLAKIGEHGCQLSSFDHSMHHAQESVSHVAEIVESSRTKARSMVDAAMQAMSSLRKGGKSIERIEDAIDFVNNQLSVDDLSVPAPRSSIPIDSAHSTVTFHDHLTAFVSNPLATGHAPDTKLQNSSDQDDLRIPSDLIVHCVATLLMIQKCTERQFPPGDVAQVLDSAVTSLKPCCSQNLSIYAEIQKCMGIIRNQILALVPT